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STEPHANE GAUBERT AND ZHENG QU 



Abstract. The analysis of classical consensus algorithms relics on contraction prop- 
erties of adjoints of Markov operators, with respect to Hilbert's projective metric or to 
a related family of seminorms (Hopf's oscillation or Hilbert's seminorm). We general- 
ize these properties to abstract consensus operators over normal cones, which include 
■ the unital completely positive maps (Kraus operators) arising in quantum information 

CN| I theory. In particular, we show that the contraction rate of such operators, with respect 

• to the Hopf oscillation seminorm, is given by an analogue of Dobrushin's ergodicity 

D ' coefficient. We derive from this result a characterization of the contraction rate of a 

non-linear flow, with respect to Hopf's oscillation seminorm and to Hilbert's projective 
metric. 



1. Introduction 

1.1. Motivation: from Birkhoff's theorem to consensus dynamics. The Hilbert 
projective metric dn on the interior of a (closed, convex, and pointed) cone C in a Banach 



< 

q 

\ space X can be defined by: 

■ P 

dnix, y) := loginf{— : a, /3 > 0, ax ^ y ^ /3a;}, 
a 

where ^ is the partial order induced by C, so that x^yiiy — xeC. Birkhoff jBir57] 
^ \ characterized the contraction ratio with respect to dn of a linear map T preserving the 
^ ' interior C'^ of the cone C, 

CN ' duiTx.Ty) diamT(C°) m/^n\ i /m m \ 

ly-v ' sup — -— r — = tanh( ), dianiT^C ) := sup dH{Tx,Ty) . 

• '. x,yeco dH{x,y) 4 ^ 

(N . 

O . This fundamental result, which implies that a linear map sending the cone C into its 
^ [ interior is a strict contraction in Hilbert's metric, can be used to derive the Perron- 
Frobenius theorem from the Banach contraction mapping theorem, see [Bus73t IKP821 
IEN95] for more information. 
^ . Hilbert's projective metric is related to the following family of seminorms. To any 

point e G C° is associated the seminorm 

X (— )■ uj{x/e) := inf{/3 — a : ae ^ x ^ /9e} 

which is sometimes called Hopf's oscillation |Hop63 , rBus73j or Hilbert's seminorm |GG04j . 



Nussbaum [Nus94j showed that dn is precisely the weak Finsler metric obtained when 
taking w(-/e) to be the infinitesimal distance at point e. In other words, 

dH{x,y) = mf u{^{s)/-f{s))ds 
f Jo 
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where the infimum is taken over piecewise paths 7 : [0, 1] — C'^ such that 7(0) = x 
and 7(1) = y. He deduced that the contraction ratio, with respect to Hilbert's projective 
metric, of a non hnear map f : ^ that is positively homogeneous of degree 1 (i.e. 
f{\x) = A/(x) for all A > 0), can be expressed in terms of the Lipschitz constants of the 
linear maps Df[x) with respect to a family of Hopf's oscillation seminorms: 

... dH{f{x)J{y)) io{Df{x)z/f{x)) 

(1) sup — ^ = sup sup -— . 

x,y€U CtH{X,y) xeU zeX,ui(z/x)^0 U}[Z/Xj 

Hence, to arrive at an explicit formula for the contraction rate in Hilbert's projective 
metric of non-linear maps, a basic issue is to determine the Lipschitz constant k{T, e) of 
linear map T with respect to Hopf's oscillation seminorm, i.e., 

(2) K(T,e):= sup ^Vy^^ • 

The problem of computing the contraction rate ([2j) also arises in the study of consensus 
algorithms. A consensus operator is a linear map T which preserves the positive cone C 
and fixes a unit element e G C°: T(e) = e. A discrete time consensus system can be 
described by 

(3) Xk+i = Tk+i{xk), k eN, 

where Ti,T2,... is a sequence of consensus operators. This model includes in partic- 
ular the case in which X = R"", C = Ml, e = {1, ■ ■ ■ ,1)^ and Tfc(x) = T(x) := Ax, 
for all k, where A is a stochastic matrix. This has been studied in the field of com- 
munication networks, control theory and parallel computation |Hir89[ IBT89[ IBGPS06| 
IMorOSt IVJAJOSt IOT09t IAB09] . Consensus operators also arise in non-linear potential 
theory |Del03] . Other interesting consensus operators are the unital completely positive 
maps acting on the cone of positive semidefinite matrices, corresponding to quantum chan- 
nel maps jSSRlOt IRKWll] . The term noncommutative consensus is coined in |SSR10] 
for the corresponding class of dynamical systems. 

The main concern of consensus theory is the convergence of the orbit Xk to a consensus 
state, which is nothing but a scalar multiple of the unit element. When = M", C = M" 
and e = (1,...,!)''', a widely used Lyapunov function for the consensus dynamics, first 
considered by Tsitsiklis (see |TBA86j ). is the "diameter" of the state x defined as 

A{x) = max {xi — xj), 

which is precisely Hopf's oscillation seminorm uj{x/e). It turns out that the latter semi- 
norm can still be considered as a Lyapunov function for a consensus operator T, with 
respect to an arbitrary cone. When C = , it is well known that if the contraction ratio 
of T with respect to the Hopf oscillation seminorm is strictly less than one, and if Tk = T, 
for all k, then, the orbits of the consensus dynamics converge exponentially to a consen- 
sus state. We shall see here that the same remains true in general (Theorem 14. 7p . For 
time-dependent consensus systems, a common approach is to bound the contraction ratio 
of every product of p consecutive operators Tj+p o ■ ■ ■ o Tj+i, = 1, 2, . . . , for a fixed p, see 
for example |Mor05j . Moreover, if {T^ : ^ 1} is a stationary ergodic random process, 
then the almost sure convergence of the orbits of ([3]) to a consensus state can be deduced 
by showing that E[log ||Ti+p . . .TiHj^] < for some p > 0, see Bougerol |Bou93] . Hence, 
in consensus applications, a central issue is again to compute the contraction ratio (j2]). 
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1.2. Main results. Our first result characterizes the contraction ratio ([2]), in a slightly 
more general setting. We consider a bounded linear map T from a Banach space to a 
Banach space ^"2. The latter are equipped with normal cones C Afj, and unit elements 

Theorem 1.1 (Contraction rate in Hopf's oscillation seminorm). Let T : Xi ^ X2 be a 
hounded linear map such that T(ei) G ]Re2. Then 

sup ^^(ZMM = 1 sup ||T*(z/) -T*(7r)||^ = sup sup (i/-7r,T(x)). 

zGA'i 2 ^^7reextr'P(e2) i/,7recxtr 7^(62) [0,ei] 

The notation and notions used in this theorem are detailed in Section [31 In particular, 
we denote by the same symbol ^ the order relations induced by the two cones Cj, i = 1,2; 
'^{^2) = {/^ £ ^2 : {fJ',G2) = 1} denotes the abstract simplex of the dual Banach space 
X2 of X2, where := {yU G X2 : (/i, x) ^0, Vx G C2} is the dual cone of C2', extr denotes 
the extreme points of a set; _L denotes a certain disjointness relation, which will be seen 
to generalize the condition that two measures have disjoint supports; [u,v] := {x E Xi : 
u ^ X ^ v}, for all M, f G A*!, and T* denotes the adjoint of T. We shall make use of the 
following norm, which we call Thompson's norm, 

WzIIt = inf{a > : — aei ^ 2; ^ aei} 

on the space Xi, and denote by || ■ ||^ the dual norm. 

When C = M", and T{z) = Az for some stochastic matrix A, we shall see that the 
second supremum in Theorem 11.11 is simply 

^ max ^ \Aik - Ajk\ = ^ max \\Ai. - Aj.\\i^ , 

where Ai. denotes the zth row of the matrix A. This quantity is called Doeblin contraction 
coefficient in the theory of Markov chains; it is known to determine the contraction rate of 
the adjoint T* with respect to the £1 (or total variation) metric, see |LPW09j . Moreover, 
the last supremum in Theorem 11.11 can be rewritten more explicitly as 

re 

1 - min mm{Ais, Ajs) , 

s=l 

a term which is known as Dohrushin's ergodicity coefficient |Dob56] . Note that in general, 
the norm || ■ ||^ can be thought of as an abstract version of the £1 or total variation norm. 

When specializing to a unital completely positive map T on the cone of positive semi- 
definite matrices, representing a quantum channel |SSR10| [RKWl ij . we shall see that the 
last supremum in Theorem 11.11 coincides with the following expression, which provides a 
non commutative analogue of Dobrushin's ergodicity coefficient (see Corollary 14. 6p : 

n 

1 — min min > min{u*T(xjX*)n, v*T{xiX*)v} 

X={xi,...,Xn) u,v:u*v=0 

XX*=In U*U=V*V=1 «=1 

Theorem 11.11 shows in particular, when C = , that the contraction rate of T with 
respect to Hopf's oscillation seminorm is the same as the contraction rate of T* with 
respect to the ii norm, given by the classical formulas of Doeblin and Dobrushin. 

Theorem 11.11 can be thought of as the dual of a result of Reeb, Kastoryano, and 
Wolf |RKWll] Prop. 12], who gave a closely related formula, without the disjointness 
restriction (and assuming that the dimension is finite) for the contraction rate of T* 
with respect to a certain "base norm", which is the dual of Thompson's norm. Thus, 
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Theorem 11.11 characterizes the contraction rate of T, whereas Proposition 12 of |RKW11] 
characterizes the contraction rate of the adjoint T*. We shall derive here the equality 
of both contraction rates from general duality considerations, exploiting the observation 
that Hopf 's oscillation seminorm coincides with the quotient norm of Thompson's norm 
(Lemma I2.7p . Then, we deduce Theorem 11.11 from a characterization of the extreme 
points of the unit ball in the dual space of the quotient normed space (Theorem I2.12p . 
The duality between both approaches is discussed more precisely in Remarks 12.61 and 13.31 
Then, we derive analogous results for flows. In particular, some consensus systems are 
driven by non linear ordinary differential equations |StrOO[ ISM03j : 

X = 0(x) 

where 0(x + Ae) = 0(a;) for all A G M. The subclass of maps that yield an order 
preserving flow is of interest in non-linear potential theory. In this context, the opposite 
of the map has been called a derivator hj Dellacherie |Del03] . 

For such systems, it is interesting to consider the contraction rate of the flow with 
respect to Hopf 's oscillation seminorm. In particular, if the contraction rate is negative, 
we deduce an exponential convergence of the orbits of the system to a consensus state. 
For simplicity, we only consider here a flow on finite dimensional space (then, a -closed, 
convex, and pointed- cone is automatically normal). 

Theorem 1.2 (Contraction rate of flows with respect to Hopf's oscillation). The contrac- 
tion rate a{U) with respect to Hopf's oscillation seminorm, of the flow of the differential 
equation x = </'(x), restricted to a convex open subset U d X , is given by: 

a{U) = sup h{D(f){x)) 

Here hiV) is deflned to be the contraction rate in Hopf's oscillation seminorm of the 
/mear differential equation x = L{x). It is given explicitly by (Proposition 15. ip : 

h{L):=- inf inf (i/, L(x)) + (tt, L(e - x)). 

!^,7rgcxtr l-'(e) 2:£extr([u,eJ) 

(i',a;} + (7r,e— a;}=0 

For illustration we apply this result to some equations in M". 

Our main results also include analogues of Theorem 11.11 and Theorem 11.21 concerning 
the contraction rate in Hilbert's projective metric of a non-linear map (Corollary 13. 9p . as 
well as a characterization of the contraction rate in the same metric of a non-linear flow 
(Theorem 17. ip . 

The paper is organized as follows. In Section [2] we give preliminary results on Thomp- 
son's metric, Hilbert's metric, and characterize the extreme points of the dual unit ball. 
In Section [3] we prove Theorem 11.11 and derive as corollary the analoguous result with 
respect to Hilbert's projective metric (CoroUarv 13. 9p . In Section |4] we apply Theorem ll.il 
to discrete time consensus operators. We determine the contraction rate of a linear flow in 
Hopf's oscillation seminorm in Section [5l In Section [6] we show Theorem 11.21 and discuss 
some applications to non-linear consensus dynamics. In Section [7] we prove the analogue 
of Theorem 11.21 with respect to Hilbert's projective metric and show its applications. 

2. Preliminaries 

2.1. Thompson's norm and Hopf's oscillation seminorm. We consider a real Ba- 
nach space {X, || ■ ||) and its dual space X*. Let C (Z X he a. closed pointed convex cone 
with non empty interior C°, i.e., aC C C for a e M+, C + C C C and C ft (-C) = 0. We 
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define the partial order ^ induced hj C on X by 

x^y-x^y — xeC. 

The dual cone of C is: 

:= {z G X*\{z,x) ^0, Vx G C}. 
Since C is a closed convex cone, it follows from the strong separation theorem that 

(4) xeC^ {z,x) ^0, Vz G C\ 

For all X ^ y we define the order interval: 

[x, y] := {z E X\x ^ z ^ y}. 

For X & X and y & C^, following |Nus88] . we define 

. . M{x/y) := mf{t G M : x ^ ty}, 

^ ' m{x/y) := sup{t G M : x ^ ty}. 

Observe that since y E C^, and since C is closed and pointed, the two sets in are non- 
empty, closed, and bounded from below and from above, respectively. In particular, m 
and M take finite values. The difference between M and m is called oscillation |Bus73j : 

u){x/y) := M{x/y) — m{x/y). 

Let e denote a distinguished element in the interior of C, which we shall call a unit. 
We define 

||x||r := max(M(x/e), — m(x/e)) 
which we call Thompson's norm, with respect to the element e, and 

\\x\\h ■■= uj{x/e) 

which we call Hopf 's oscillation seminorm with respect to the element e. 

We assume that the cone is normal, meaning that there exists a constant K > such 
that 

^ X ^ y ^ \\x\\ ^ -ft'll?/!!- 

It is known that under this assumption the two norms || • || and || ■ ||t are equivalent, 
see |Nus94] . Therefore the space X equipped with the norm || ■ Wt is a Banach space. 
Since Thompson's norm || • \\t is always defined with respect to a particular element, we 
write {X,e, \\ ■ instead of {X, \\ ■ \\t). 

By the definition and (jl]), Thompson's norm with respect to e can be calculated by: 

II II 1(^)^)1 

(d) \\x\\t = sup — — . 

zee* {z,e) 

Example 2.1. We consider the space X = M", the closed convex cone C = M" and the 
unit element e = 1 := (1, . . . , 1)^. It can be checked that Thompson's norm with respect 
to 1 is nothing but the sup norm 



||x||t = max \xi\ = \\x\\oo, 

i 

whereas Hopf 's oscillation seminorm with respect to 1 is the so called diameter. 

\\x\\h = max (xj — x,-) = A(x). 



Example 2.2. Let X = S„, the space of Hermitian matrices of dimension n and C = C 
S„, the cone of positive semi-definite matrices. Consider the unit element e = /„, the 
identity matrix of dimension n. Then Thompson's norm with respect to /„ is nothing 
but the sup norm of the spectrum of X, i.e., 

\\X\\T=m^x\{X) = \\X{X)\U 

where X{X) := (Ai(X), . . . , A„(X)), Ai(X) ^ ... ^ A„(X), is the vector of ordered 
eigenvalues of X, counted with multiplicities, whereas Hopf 's oscillation seminorm with 
respect to /„ is the diameter of the spectrum: 

\\X\\h = max {\{X) - X,{X)) = A(A(X)). 

2.2. Simplex in the dual space and dual unit ball. We denote by {X*,e, \\ ■ ||^) 
the dual normed space of {X, e, || ■ Wx) where the dual norm || • ||y of a continuous linear 
functional z G X* is defined by: 

ll^ll^ := sup {z, x). 

\\x\\t=1 

We define: 

(7) P(e):={/iGC'^| (/i,e) = l} 

the simplex with respect to e of the dual Banach space {X*, e, || ■ ||^). 

Remark 2.3. When X = R", C = M!; and e = 1 (Example E}, the dual space X* is 
X = M" itself and the dual norm || ■ ||^ is the £i norm: 

II II* — I I — II II 

1 1 1 1 T — / I i I — 1 1 1 1 1 * 

i 

The simplex V{1) defined in ([7j) is the simplex in M" in the usual sense: 

P(l) = {z/GM^5^i/. = l}, 

i 

i.e., the set of probability measures on the discrete space {1, . . . ,n}. 



Remark 2.4. In the case of X = S„, C = and e = In (Example 12. 2p . the dual space 
X* is X = Sn itself and the dual norm || ■ ||^ is the trace norm: 

II^IIt= Y1 I^^WI = ll^lli' ^eSn 

The simplex V{In) defined in ([7]) is the set of positive semi-definite matrices with trace 
1: 

V{In) = {peSl: trace(p) = 1}. 

The elements of this set are called density matrices in quantum physics, in which they 
are thought of as noncommutative analogues of probabilities measure. 

The next lemma relates V{e) and the unit ball B^{e) of the space {X*,e, \\ ■ We 
denote by conv(5') the convex hull of a set 5*. 

Lemma 2.5. The unit ball B^{e) of the space {X*,e, || ■ ||^), satisfies 
(8) B^{e) = conv(P(e) U -V{e)) 



Proof. For simplicity we write V instead of V{e) and instead of B^{e) in the proof. 
It follows from that 



(9) = sup I (/i, x) I = sup (/i, x) 

Hence \\z\\^ ^ 1 if and only if, for all x G A", 

(10) {z,x)^\\x\\t= sup (/i, x) . 



By the strong separation theorem |FHH"'"01[ Thm 3.18], if z did not belong to the closed 



convex hull conv('P U — P), the closure being understood in the weak star topology of 
rY*, there would exist a vector x E X and a scalar 7 such that (2;, x) > 7 ^ (yU, x), for all 
fieVU -V, contradicting ([lO]). 

= conv(P U -P) . 

We claim that the latter closure operation can be dispensed with. Indeed, by the Banach 
Alaoglu theorem, B^ is weak-star compact. Hence, its subset V, which is weak-star 
closed, is also weak-star compact. If /i G B^, by the characterization of B^ above, /i is a 
limit, in the weak star topology, of a net = Sai^a — iaT^a with Sa + ta = 1, ta ^ and 
i^a, tTq G P for a G A. By passing to a subnet we can assume that Sa,ta : a ^ A converge 
respectively to s, t G [0, 1] such that s + t = 1 and UayHa : a & A converge respectively in 
the weak-star topology to z/, vr G V. It follows that fi = su — tn ^ conv(P U —V). □ 

Remark 2.6. We make a comparison with the paper |RKW11] . In a finite dimensional 
setting, Reeb, Kastoryano, and Wolf defined a base ;B of a proper cone /C in a vector space 

V to be a cross section of this cone, i.e., they take B to be the intersection of the cone 
/C with a hyperplane given by a linear functional in the interior of the dual cone. So, V 
corresponds to X* here, and, since V is of finite dimension, we can identify the dual of 

V to X, and consider the dual cone C c:^ IC* G V* ^ X . Modulo this identification, the 
base B can be written precisely as i3 = {/i G /C : (/i, e) = 1} for some e in the interior 
of /C*, so that the base B coincides with the simplex V{e) considered here. They defined 
the base norm of G V with respect to B by: 

\\fi\\t3 = inf{A ^ 0|;u G A conv(-B U -B)}. 

Lemma [2.51 shows that the base norm coincides with the dual norm of Thompson's norm: 
for u G A"*, 

= inf{A ^ 0\u G Aconv(P(e) U -V{e))} = inf{A ^ 0\u G XB^} = 

The set 

M = {x e V*\0 ^ X ^ e}. 
is also considered in |RKWllj ; leading to define the distinguishability norm of /i G V by: 
(11) \\fj,\\j^ = sup (/i, 2x-e). 

It is shown there that 

II/^IIa7 = II^IIb • 



2.3. Extreme points of the dual unit ball. We first show that Hopf's oscillation 
seminorm coincides with the norm on the quotient Banach space of (A", e, 2|| ■ by the 
closed subspace Me. 

Lemma 2.7. For all x E X , we have: 

\\x\\h = 2 inf \\x + Ae||r 



Proof. \\x + Ae||T = (M(a;/e) + A) V (— m(a;/e) — A) is minimal when {M{x/e) + A) = 
(— m(x/e) — A). Substituting the value of A obtained in this way in ||a; + Ae||r, we arrive 
at the announced formula. □ 

A standard result |Con90[ P.88] of functional analysis shows that if W is a closed sub- 
space of a Banach space {X , \\ • ||), then the quotient space X /W is complete. Besides, 
the dual of the quotient space X /W can be identified isometrically to the space of con- 
tinuous linear forms on X that vanish on W, equipped with the dual norm || ■ ||* of X*. 
Specializing this result to W = Me, we get: 

Lemma 2.8. The quotient normed space (A'/Me, || ■ is a Banach space. Its dual is 
{M.{e), II ■ where 

M{e) := {/iGA"^|(/i,e) = 0}, 

and 

(12) ||/i|lH:=^ll/^llr, V/ieA^(e). 

The above lemma implies that the unit ball of the space (A^(e), || ■ ||^), denoted by 
B*fj{e), satisfies: 

(13) S^(e) = 25^(e) n A^(e). 

Remark 2.9. In the case oi X = M", C = Kl and e = 1 (Example O and Remark [23]), 
Lemma 12.81 implies that for any two probability measures fi,^ E V{1), the dual norm 
||;U — z/||^ is the total variation distance between /i and u: 

„ „. 1, 

Before giving a representation of the extreme points of i?|^(e), we define the disjointness 
relation ± on V{e). 

Definition 2.10. For all i/, vr G 'P(e), we say that u and tt are disjoint, denoted by z/ _L vr, 
if 

U + IT 

for all /i G V{e) such that /i ^ | and /i ^ ^. 

In particular, we remark the following property: 
Lemma 2.11. Let iy,n E V{e). The following assertions are equivalent: 

(a) Z/ ± TT. 

(b) The only elements p,o^ E V{e) such that 

u — IT = p — a 

are p = v and a = n. 



Proof, (a)^ (b): Let any p,o' ^ such that 

V — Ti = P — (J. 

Then it is immediate that 

z/ + cr = vr + p. 

Let /i = = Then ji G 'P(e), p ^ | and p ^ |. Since we assumed ± vr, we 
obtain that ^ = It follows that p = u and cr = tt. 

(b)^ (a): Let any p EV{e) such that /i ^ | and /i ^ |. Then 

u — IT = (2/i — tt) — (2/i — u). 

From (b) we know that 2p — it = u. □ 

We denote by extr(-) the set of extreme points of a convex set. 

Theorem 2.12. The set of extreme points of B*^{e), denoted by exti B'^[e) , is charac- 
terized by: 

extT B^{e) = {// — TT : I/, TT G extrP(e), u ± vr}. 

Proof. It follows from ([8]) that every point p G B^{e) can be written a.s p = su — tir with 
s + t = 1, s,t ^ 0, v^TT e V. Moreover, if /i G M{e), = {p,e) = e) — t(7r,e) = s — t, 
and so s = t = |. Thus every /i G B^{e) fl A^(e) can be written as 

p = -^—^ i^.TT G •P(e). 
Therefore by f[T51) we proved that 

(14) 5^(e) = {z/-7r : z/,7r G P(e)}. 

Now let i^, TT G extr P(e) and z/ ± tt. We are going to prove that u — tt E extr B^{e). Let 
TTi, z/2, 7r2 G P(e) such that 

Z/l — TTi Z/2 — 7r2 
Z/ — TT = 1 . 

2 2 

Then 

Ui + U2 TTi + 7r2 
Z/ — TT = . 

2 2 

By Lemma 12.11^ the only possibility is 2z/ = z/i + z/2 and 27r = tti + 7r2. Since z/, tt G 
extrP(e) we obtain that Ui = 1/2 = u and tti = 712 = vr. Therefore z/ — tt G extri?^(e). 

Now let z/, TT G 'P(e) such that u — n E extri?|^(e). Assume by contradiction that z/ 
is not extreme in V{e) (the case in which vr is not extreme can be dealt with similarly). 
Then, we can find z/i, z/2 G V{e), ui 7^ z/2, such that z/ = It follows that 



z/i — TT Z/2 — vr 



2 2 ' 

where z/i — tt, z/2 — vr are distinct elements of -B|^(e), which is a contradiction. Next we 
show that z/ ± TT. To this end, let any p, o" G 'P(e) such that 

z/ — TT = p — cr. 

Then 

u — 71 + p — a u — a p — IT 

u — 71 = = 1 . 

2 2 2 

If cr 7^ TT, then u — cr ^ u — 71 and this contradicts the fact that z/ — vr is extremal. Therefore 

cr = vr and p = u. From Lemma 12. IH we deduce that z/ ± vr. 

□ 
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Remark 2.13. When A" = R", C = Rlj: and e = 1 (Example O and Remark [23]) , the set 
of extreme points of V{1) is the set of standard basis vectors {ej}j=i...,^„. The extreme 
points are pairwise disjoint. 

Remark 2.14. When A' = S„, C = S+ and e = /„ (ExampleOand Remark[23D, the set 
of extreme points of V{In) is: 

extrP(J„) = {xx* ■ X G C",x*x = 1}. 

Two extreme points xx* and yy* are disjoint if and only if x*y = 0. To see this, note that 
if x*y = then any Hermitian matrix X such that X ^ xx* and X ^ yy* should satisfy 
X ^ XX* + yy*. Hence by definition xx* and yy* are disjoint. Inversely, suppose that xx* 
and yy* are disjoint and consider the spectral decomposition of the matrix xx* — yy*, i.e., 
there is A ^ 1 and two orthonormal vectors u, v such that xx* — yy* = X{uu* — vv*). It 
follows that XX* — yy* = uu* — ((1 — X)uu* + Xvv*). By Lemma [2. Ill the only possibility 
is yy* = (1 — X)uu* + Xvv* and xx* = uu* thus X = 1, u = x and v = y. Therefore 
x*y = 0. 

3. The operator norm induced by Hope's oscillation 

Consider two real Banach spaces and Let Ci C Xi and C2 C X2 be respec- 
tively two closed pointed convex normal cones with non empty interiors C° and Cg. Let 
ei e Ci and 62 G Cg. Then, we know from Section [2] that the two quotient spaces 
{Xi/Wei, II • \\h) and (A'2/Me2, || • \\h) equipped with the Hopf's oscillation seminorms as- 
sociated respectively to ei and e2 are Banach spaces. The dual spaces of (A'l/Rei, || • Hi^) 
and (A'2/Me2, || • \\h) are respectively the spaces (A^(ei), || ■ ||J^) and (A^(e2), || ■ |||^) (see 
Lemma 12. 8p . 

Let T denote a continuous linear map from (A'l/Rei, || ■ \\h) to (A'2/Me2, || ■ ||_h-). The 
operator norm of T, denoted by ||T||j:/, is given by: 

\\T\\h ■■= sup ||r(x)||H 

The adjoint operator T* : (A^(e2), || ■ ||^) -t- (A^(ei), || ■ ||^) of T is by definition: 

{T*{fi),x) = {fi,T{x)), e Mie2),xe X^. 
The operator norm of T*, denoted by ||T*||J^, is then: 

\\TrH--= sup \\T\^^)rH. 

A classical duality result (see [AB99[ § 6.8]) shows that an operator and its adjoint 
have the same operator norm. In particular, 

\\T\\H = \\T*rH- 

Theorem 3.1. Let T : Mi ^ X2 he a hounded linear map such that T(ei) G ]Re2. Then, 

ll^ll-H" = o ^ ^*(^)llr = sup sup {ly — n,T{x)). 

Moreover, the supremum can he restricted to the set of extreme points: 
(15) ||T||h = ^ sup \\T\v) - T\'K)\\1r = sup sup (z/-7r,T(x)). 
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Proof. We already noted that ||T||j|/ = ||T*|||^. Moreover, 

\\T*rH= sup iiT^(/x)rH. 

By the characterization of B^{e2) in (fT4l) and the characterization of the norm || ■ |||^ in 
Lemma [2 .Sj we get 

sup ||r^(/i)||^= sup ||r»-T*(7r) 11^ = 1 sup ~ T\n)r^ 

AteBJj(e2) i/,7rGP(e2) ^ !^,7r67'(e2) 

For the second equahty, note that 

||T*(z/)-T^(7r)||^ = sup {T*{u)-T*{n),2x-e^) 

xe[0,ei] 

= 2 sup (r*(z/) -T*(7r),x) 

a;e[0,ei] 

We next show that the supremum can be restricted to the set of extreme points. By 
the Banach-Alaoglu theorem, is weak-star compact, and it is obviously convex. The 
dual space Ai endowed with the weak-star topology is a locally convex topological space. 
Thus by the Krein-Milman theorem, the unit ball B^, which is a compact convex set in 
Ai with respect to the weak-star topology, is the closed convex hull of its extreme points. 
So every element p of B'^{e2) is the limit of a net {pa)a of elements of convextr i?|^(e2). 
Observe now that the function 

if-.fi^ \\T*{p)\\h = sup (T*(/i),x) = sup (/i,T(x)) 

which is a sup of weak-star continuous maps is convex and weak-star lower semi-continuous. 
This implies that ip{p) ^ liminf^, (p{pa) < sup,on^extrBj,(e2) <^(/^) = suPextrB^(e2) Us- 
ing the characterization of the extreme points in Proposition 12. 12[ we get: 

sup \\T\^)rH= sup ||T^(/i)||J,= sup ||T^(z/)-r*(7r)||^. □ 

At6-B|f(e2) /igextr (e2) ;/,7reextr 7^(62) 

Remark 3.2. When Xi is of finite dimension, the set [0, ei] is the convex hull of the set of 
its extreme points, hence, the supremum over the variable x G [0, ei] in f|T5l) is attained 
at an extreme point. Similarly, if is of finite dimension, the suprema over (z/, vr) in 
the same equation are also attained, because the map ip in the proof of the previous 
theorem, which is a supremum of an equi-Lipschitz family of maps, is continuous (in fact, 
Lipschitz). 

Remark 3.3. Theorem 13.11 should be compared with Proposition 12 of [RKWll] which 
can be stated as follows. 

Proposition 3.4 (Proposition 12 in |RKWll] ). Let L : V ^ V he a linear map and let 
B CV and B' C V be bases. Then 

\\L{vi) - L{v2)\\b' 1 ^ 
(16) sup = - sup \\L{vi) - L{v2)\\b' 

The first term in (fTB]) is called the contraction ratio of the linear map L, with respect 
to base norms. One important applications of this proposition concerns the base pre- 
serving maps L such that L{B) C B'. Let us translate this proposition in the present 
setting. Consider a linear map T : X /M.ei — )■ X /'Re2. Then T*{V{e.2)) C V{e.2) is a base 
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preserving linear map and so, Proposition 12 of |RKW11] shows that: 

(17) sup l'7"~l]"^ = ^ I|t»-t*WIIt 

Hence, by comparison with |RKWll] . the additional information here is the equality 
between the contraction ratio in Hopf 's oscillation seminorm of a unit preserving linear 
map, and the contraction ratio with respect to the base norms of the dual base preserving 
map. The latter is the primary object of interest in quantum information theory whereas 
the former is of interest in the control/consensus literature. We also proved that the 
supremum in flTTl) can be restricted to pairs of disjoint extreme points tt. Finally, the 
expression of the contraction rate as the last supremum in Theorem 13.11 leads here to an 
abstract version of Dobrushin's ergodic coefficient, see Eqn fl2U]) and Corollary 14.61 below. 

Recall that Hilbert's projective metric between two elements x,y ^ is defined as: 

dHix,y) = \og{M{x/y)/m{x/y)). 

Consider a linear operator T : Xi ^ X2 such that T(C°) C €2- Following jBir57[[Bus73j . 
we define the projective diameter of T as below: 

diamT = sup{(i//(T(x), T(y)) : x,y E Ci}. 

The Birkhoff's contraction formula |Bir57l [Bus 73] states that: 

Theorem 3.5 f [Bi?57l[B^I^ ) . 

u{T{x),T{y)) dHiT{x),T{y)) diamT 
sup r = sup -— r = tanh( ). 

x.yec? ^{x,y) x,y&c1 dH{x,y) 4 

Following |RKWll] . we define the projective diameter of T* by: 
diamT* = snp {dH{T*{u),T*{v)) : u,v e C*\0}. 

Note that diamT = diamT*. This is because 

M{T{x)/T{y)) ^ {u,Tix)){v,T{y)) 
x,yeco rn{T{x)/T{y)) x,yecOu,vec*^\o {u,T{y)){v,T{x)) 

M{T*{u)/T*{v)) 
u,v^c*\o m{T*{u)/T*'{v)) 

Corollary 3.6 (Compare with |RKW11] ). Let T : Xi ^ X2 he a hounded linear map 
such that T(ei) G Mes and T{C^) C , then: 

,1^,1 , .diamT, , .diamT*. 
||T*||;^ = ||T||h ^ tanh( — ^ — ) = tanh( ^ ) 

Proof. It is sufficient to prove the inequality. For this, note that 

||T||// = sup a;(T(x), e2)/w(x, ei) = sup w(T(x), e2)/a;(x, ei). 

Then we apply Birkhoff's contraction formula. □ 

Remark 3.7. Reeb et al |RKW11] showed in a different way that 

, .diamT*. 
||T*||^, ^ tanh(^^). 

The proof above shows that as soon as the duality formula ||T*||^ = ||T||// has been 
obtained, the latter inequality follows from Birkhoff contraction formula. 
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Nussbaum [Nus94j showed that the Lipschitz constant in Hilbert's projective metric 
of a non-hnear map is determined by the operator norm of its derivative with respect 
to Hopf's oscillation seminorm. We first use this result to deduce a characterization of 
the contraction rate of non linear maps in Hilbert's metric. We first quote the result 
of jNus94] which we shall use. 

Theorem 3.8 (Coro 2.1, jNus94] ). Let U d he a convex open set such thattU C U for 
allt > 0. Let f : U ^ be a continuously dijferentiable map such that u{f{x)/ f{y)) = 
whenever x,y ^ U and uj{x/y) = 0. For each x eU define \{x), Aq and ko by: 

X{x) := inf{c > : u {D f {x)v / f (x)) ^ cu{v/x) for all v G X}, 

Ao := sup{A(x) : x G U}, 
ko := inf{c > : dnifix), f{y)) ^ cduix, y) for all x,y eU}. 
Then it follows that Xq = k^. 

Then a direct corollary of Theorem 13.11 and 13 . 81 yields the Lipschitz constant in Hilbert's 
metric of a (non-linear) map. 

Corollary 3.9. Let U and f be as in Theorem \3.8\ . Then: 

dH{f{x)J{y)) , nff \ \ 

sup -— r = sup sup sup [l' — IT, Df [X)Z) . 

x,y£U dH{X,y) x£U u,Tr&extiVifix)) z&[0,x] 

Remark 3.10. This corollary generalizes Corollary 2.1 of |Nus94j . which gives a similar 
characterization in terms of extreme points, when X = and C = . Note that in 
the finite dimensional case, the suprema over the variable z and over the variables u, n 
are attained (see Remark 13. 2p . Moreover, the supremum over z is attained at an extreme 
point of [0, x]. 

4. Application to discrete consensus operators on cones 

A classical result, which goes back to Dceblin and Dobrushin, characterizes the Lips- 
chitz constant of a Markov matrix acting on the space of measures (i.e., a row stochastic 
matrix acting on the left), with respect to the total variation norm (see the discussion in 
Remark 14.51 below). The same constant characterizes the contraction ratio with respect 
to the "diameter" (Hopf oscillation seminorm) of the consensus system driven by this 
Markov matrix (i.e., a row stochastic matrix acting on the right). Consensus operators 
on cones extend Markov matrices. In this section, we extend to these abstract operators 
a number of known properties of Markov matrices. 

A linear map T : X X is a. consensus operator with respect to a unit vector e in 
the interior C° of a closed convex pointed cone C C A" if it satisfies the two following 
properties: 

(i) T is positive, i.e., T(C) C C. 

(ii) T preserves the unit element e, i.e., T(e) = e. 

Example 4.1. When X = M", C is the standard orthant and e is the standard unit vector 
1 (Example 12. ip . a linear map T{x) = Ax is a consensus operator if and only if A is a 
row stochastic matrix. The operator norm is the contraction rate of the matrix A with 
respect to the diameter A: 

\\T\\h = t{A) := sup , 
X A (a;) 
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and the dual operator norm is the Lipschitz constant of on V{1) with respect to the 
total variation distance: 

\\T\\h = ^{^) ■= sup 

The value t{A) allows one to bound the convergence rate of the stationary linear consensus 
system the dynamics of which is given by the matrix A, jMDAOSj IVJAJ05] . The value 
S{A) is known as the ergodicity coefficient of the Markov chain with transition probability 
matrix A"^, see |LPW09] . 

Example 4.2. When X = Sn, C = and e = In (Example 12. 2|) . the linear map $ : S„ — )■ 
S„ defined by 

m m 

(18) $(x) = y:xv,. E = ^« 

i=l 1=1 

is a consensus operator. The dual operator is then given by: 

i=l 

Both maps are completely positive. They represent a purely quantum channel jRKWlll 
ISSRlOj . The map $ is unital and acts between spaces of operators while the adjoint map 
^ is trace-preserving and acts between spaces of states (density matrices). The operator 
norm of $ : S„ /M/n S„ /M/„ is the contraction rate of the diameter of the spectrum: 

ll^ll^ = sup ^max($(X)) - A^in($(X)) 

{X) - \^^{x) ■ 

The operator norm of the adjoint map ^ : V{In) 'Pi^n) is the contraction rate of the 
trace distance: 

ii*ii.„= sup ii'^':'>-'''f^>"'. 

Pi,P2eV(in) IIP1-P2II1 
The value \\^\\h and ||^|||^ are the noncommutative counterparts of r(-) and 6{-). 



A direct application of Theorem 13.11 leads to following characterization of operator 
norm, which will be seen to extend Dobrushin's formula (see Remark 14.51 below) . 

Corollary 4.3. Let T : X ^ X be a consensus operator with respect to e. Then, 
||T||j^ = IIT'^II;^ = 1 - inf inf (7r,T(a;)) + (i^,T(e-a;)). 

i/,7r£extr ^(e) 2,'G[0,e] 

Proof. Since r(e) = e, we have: 

sup sup (z/ — TT, T(x)) = sup sup 1 — (vr, T(x)) — (i/, T(e — x)). 

i^,7rGextr 'P(e) 2;G[0,e] !^,7r£cxtr ^{e) xg[0,e] 

□ 



Remark 4.4. In the finite dimensional already noted in Remark l3.2[ the supremum 

is reached at extr[0, e]. 

Remark 4.5. In the case of a stochastic matrix A (Example 14. ip . Corollary 14.31 implies 
that: 

(19) t{A) = 6{A) = ^ sup WA'^Ci - A^CjIli. 

14 



This is a known result in the study of Markov chain |Sen91j . The value t{A) is known 
under the name of Dohrushin's ergodic coefficient of the stochastic matrix A |Dob56] . It 
is explicitly given by: 



(20) t{A) = 1 -minV min(A.,^i.). 

S = l 

Indeed, the characterization of t{A) = \\T\\h hj the last supremum in Corollary 14. 31 yields 



r(A) = 1 - min min (V^ Aik + A 



■jk) 

k&I kil 



from which (120|) follows. 

A simple classical situation in which t{A) < 1 is when there is a Dxhlin state, i.e., an 
element j G {1, . . . , n} such that Aij > holds for alH G {1, . . . , n}. 

Specializing Corollary 14.31 to the case of quantum channels (Example I4.2[) . we obtain 
the noncommutative version of Dobrushin's ergodic coefficient. 

Corollary 4.6. Let ^ he a quantum channel defined in ( IT8|) . Then, 

n 

(21) ||$||^ = ||vl/||^ = 1 - min min min{M*$(xjX*)M, w*$(xja;*)t;} 

u,v:u*v=Q X=(x\,...,Xn) 

U*U=V*V=\ XX*=ln 

Proof. It can be easily checked that 

extr[0,/„] = {PGS, :P2 = P}. 

Hence, Corollary 14.31 and Remark 12.141 yield: 

= ||v[/||^ = 1 - min min «*$(/„- F)m + t;*$(F)t; 

u''u=v*v=l 

= 1 — min min min > u*^{xiX*)u + / v*^(xiX*)v 

u,v:u*v=0 X=(xi,...,Xn) JC{1,. ..,?!} 
u*u=v*v=l XX*=/„ ^^-^ 

from which (!2T|) follows. □ 

We now make the following basic observations for a consensus operator T : X ^ X: 

M{T{x)/e) ^ M(x/e), m(T(x)/e) ^ m(x/e), Vx G X. 

It follows that \\T\\h ^ 1- The case when \\T\\h < 1 or equivalently < 1 is of 

special interest, as shown by the following theorem, which shows that the iterates of T 
convergence to a rank one projector with a rate bounded by ||T||j:/. 

Theorem 4.7 (Geometric convergence to consensus). J/ ||T||i^ < 1 or equivalently 
||T*||^ < 1, then there is E V{e) such that for all x E X 

WT'^ix) - {7r,x)e\\T ^ {WTWhTWxWh, 

and for all fi G V{e) 

\\{Tr{f^)-^H<{\\T\\Hr. 
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Proof. The intersection 

n„[m(T"(x)/e),M(T"(x)/e)] C R 

is nonempty (as a non- increasing intersection of nonempty compact sets), and since 
\\T\\h < 1 and 

a;(T"(x)/e) ^ (||T||H)"a;(x/e), 
this intersection must be reduced to a real {c(x)} C M depending on x, i.e., 

c{x) = n„[m(T"(x)/e),M(T"(x)/e)]. 

Thus for all n G N, 

-a;(T"(x)/e)e ^ T"(x) - c{x)e ^ a;(T"(a;)/e)e. 
Therefore by definition: 

||T"(x) - c(x)e||T ^ w(T"(x)/e). 

Then we get: 

||T"(x)-c(x)e||T^(||T||Hn|a;||H. 

It is immediate that: 

c{x)e = lim T"(x) 

n— >oo 

from which we deduce that c : — > M is a continuous linear functional. Thus there is 
IT G X* such that c{x) = (7r,x). Besides it is immediate that (vr,e) = 1 and tt G C* 
because 

X eC ^ c{x)e eC ^ c{x) ^ ^ (tt, x) ^ 0. 
Therefore vr G P. Finally for all yU G P and all x G A* we have 

((r=^)"(/i) - vr, x) = (/i, r"(x) - (tt, x)e) 

^ ||/i||^||T"(x) - (7r,x)e||r 

^ (iiriknixiiH. 

Hence 

ll(TT(/i)-vr||^,^(||T||Hr. 

□ 

Remark 4.8. Specializing Theorem 14.71 to the case of A" = M" (Example 12.11) we obtain 
that if t{A) = 6{A) < 1, then 

where vr is the unique invariant measure of the stochastic matrix A. This is a well-known 
result in the study of ergodicity property and mixing times of Markov chains, see for 
example [SenQlj and |LPW09j . 

Remark 4.9. A time-dependent consensus system is described by 
(22) Xfc+i =Tfc+i(xfe), keN 

where {T^ : A; ^ 1} is a sequence of consensus operators sharing a common unit element 
e G Then if there is an integer p > and a constant a < 1 such that for alH G N 

\\Ti+p . . . Ti^iWn ^ tt, 

then the same lines of proof of Theorem 14.71 imply the existence of tt G V{e) such that 
for all {xk} satisfying ([22D, 

\\xk - (7r,xo)e||r ^ a^^^ \\xo\\h, n G N. 

16 



Remark 4.10. In the case oi X = and Tk{x) = A^x where is a stochastic matrix, 
Moreau |Mor05j showed that if all the non-zero entries are bounded from below by a 
positive constant and if there is p G N such that for alH G N there is a node connected to 
all other nodes in the graph associated to the matrix Aj+p . . . Aj+i, then the system [22] is 
globally uniformly convergent. These two conditions imply exactly that the Dobrushin's 
ergodic coefficient fl20|) of Ai^p . . . Aj+i, which is also the operator norm ||Tj+p . . . Tj+iH^, 
is bounded by a constant less than 1. 

5. The contraction rate in Hope's oscillation of a linear elow 

5.1. Abstract formula for the contraction rate. Hereinafter, we only consider a 
finite dimensional vector space X. The set of linear transformations on X is denoted by 
End(A'). Let L G End(A') such that L{e) = 0. The next proposition characterizes the 
contraction rate of the flow associated to the linear differential equation 

X = L{x), 

with respect to Hopf's oscillation seminorm. 
Proposition 5.1. The optimal constant a such that 

II ex-p{tL)x\\H ^ e"*||x||H, Vt ^ 0, x G A" 

is 

(23) h{L):=- inf inf (z., + (tt, L(e - x)). 

i/,7rGextr K(e) x£cxtr([U,eJ) 

Proof. Let I : X ^ X denote the identity transformation. We define a functional on 
End{X) by: 

F(W) = sup sup (tt - z/, 

u,TreV{e) xe[0,e] 

By Theorem 13. H the optimal constant a is: 

a = lim e~^(|| exp(eL)||// — 1) 

e->-0+ 

(24) = lim e-VF(exp(eL)) - F(/)) . 

Recall that a map is said to be semidijjerentiable at a point if it has one-sided directional 
derivatives in all directions, and if the limit defining the one-sided directional derivative is 
uniform in the direction, see Definition 7.20 of |RW98j . to which we refer for information 
on the different notions used here. The limit in ( I24p coincides with to the semiderivative 
of F at point / in the direction L if F is semidifferentiable. We next show that it is so, 
and compute the limit. Since we assume that V{e) and [0, e] are compact sets and the 
function 

F,,^,,{W) = {7r-iy,Wix)) 

is continuously differentiable on W such that -Fv,7r,x(W^) and DF^t^^^{W) are jointly con- 
tinuous on (z/, TT, X, ly), we know that F : End(A:') — ?■ M defines a subsmooth function 
(see [RW981 Def 10.29] therefore F is semidifferentiable and the semiderivative of F at 
point / in the direction L equals to (see |RW981 Thm 10.30]) 

DF{I){L)= sup (7r-z/,L(x)) 

u,n,xeT{I) 

where 

T(J) = arg max F^^^^^{I). 
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Hence, 



a =DF{I){L) 

= sup sup {tx — u,L{x)) 

= — inf inf {u, L{x)) + {n, L{e — x)) . 

u,TT&V{e) a;6[0,e] 

(y,x) + (TT,e—x)=0 

Since X is finite dimensional, the sets 'P(e) and [0, e] are both compact, and they are the 
convex hull of their extreme points. Henceforth, arguing as in Remark 13.21 above, we can 
replace V{e) and [0, e] by extrP(e) and extr([0,e]), respectively. □ 



5.2. Contraction rate in W^. One may specialize Formula (12^ to the case X = 
C = M" and e = 1. For x G M" we denote by S{x) the diagonal matrix with entries x. 

Corollary 5.2. Let A be a square matrix such that Al = 0. Then 

(25) h{A) = - min (A^i + Aj + V mm{Aik, Ajk)) . □ 

Proof. Recall that 

extr(P(l)) = {e, : i = 1, . . . , n}, extr[0, 1] = : J C {1, . . . , n}}. 

Therefore we have: 

hi A) =-min min } Aik+y Ajk 

iy^j IC{l,...,n}^ ^ 

= - min Aij + Aji + min Aik + Ajk 

= - min Aij + Aji + "S^ mm{Aik, Ajk). 



□ 



Remark 5.3. Consider the order-preserving case, i.e. Aij ^ for z 7^ j. Such situation was 
studied extensively in the context of consensus dynamics. In particular, let G = {V, E) 
be a graph and equip each arc e -E a weight Cij > (the node j is connected to i). 
One of the consensus systems that Moreau |Mor05j studied is: 

Xi ^ ^ ^iji.'^j -^i)) ^ 1, . . . , 77. . 

This can be written as i = Ax, where Aij = Cij for i ^ j and = Cij is a discrete 
Laplacian. A general result of Moreau implies that if there is a node connected by path 
to all other nodes in the graph G, then the system is globally convergent. Our results 
show that if h{C) < then the system converges exponentially to consensus with rate 
h{C). The condition h{C) = means that there are two nodes disconnected with each 
other [Cij + Cji = 0) and all other nodes are connected by arc to at most one of them 
(^^^1^ min(Cifc, Cjfc) = 0). The condition h{C) < 0, though more strict than Moreau's 
connectivity condition, gives an explicit contraction rate. 
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Remark 5.4. In addition, our result applies to not necessarily order-preserving flows. For 
example, consider the matrix 

■ -3 1 2 
A = [ 1 0-1 
1 1 -2 

A basic calculus shows that h{A) = —1. Therefore, every orbit of the linear system 
X = Ax converges exponentially with rate —1 to a multiple of the unit vector. 

Remark 5.5. We point out that as a contraction constant, h{A) makes sense only when 
Al = 0. However, as a functional h is well defined on the space of square matrices. 
Moreover, since the diagonal elements do not account in the formula f l25p . it is clear that 
for any square matrix B G M„(R) and x G M" 

h{B) = h{B-S{x)). 

5.3. Contraction in the space of Hermitian matrices. We now specialize For- 
mula (1231) to the case X = S„, C = St and e = J„: 



Corollary 5.6. Let $ : S„ — )■ S„ be a linear application such that $(/n) = 0. Then 
(26) 



n 



= — inf (xl(^{x2X2)xi + X2^{xixl)x2 + y mm{xl^{xkxl)xi, xl^{xkxl)x2)) ■ 

X={xi,...,x„) 

XX'=I„ '==3 

where xi is the i-th column vector of each unitary matrix X . □ 
Proof. Recall that 

extr(P(/„)) = {xx* : x G C",x*x = 1}, extr[0,4] = {P G S„ : = P}. 

Then, 

= - inf inf Xi$(P)xi + x*2^{In - P)x2 

x'^X\=X2X2 = i P'^=P 

Pxi=0,Px2=X2 

k 

= - inf inf xl^{xiX*)xi + xl^{In - P)x2 

x\x\=X2X2 = l P=(x2,X3,...,Xic) 

P^=P,Pxi=0 



( inf xl^{x2X2)xi + xl^{xixl)x2 

X'^Xl=X2X2 = l 



k n 

inf S2x\^{xiX*)xi+ 'S^ x*2^{xiX*)x2). □ 

X={xT_,X2,...,Xn) ^—^ 

XX* =1^ «=3 i=k+\ 



As pointed out in Remark 15.51 is a functional well defined for all linear applications 
from Sn to Sn. It is interesting to remark that for any linear application and any square 
matrix Z, 

where <I>(X) = ^{X) - ZX-XZ for all X G S„. 
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5.4. Contraction rate of time-dependent linear flows. We now state the result 
analogous to Proposition 15. which applies to time dependent linear flows. Let to > 
and L.{-) : [0,to) x A" — > A" be a continuous application linear in the second variable 
such that Lt{e) = for all t G [0,to). We denote by U{s,t) the evolution operator of the 
following linear time-varying differential equation: 

x{t)=Lt{x), t G [0,to). 

Then a slight modiflcation of the proof of Proposition 15. II leads to the following result. 

Proposition 5.7. The optimal constant a such that 

\\U{s,t)x\\H < e"(*-^)||x||H, Vs,t G [0,to),xe X. 

is 

(27) sup h{Lt) = — inf inf inf {u, Lt{x)) + {n, Lt{e — x)) . 

tg[0,to) tG[0,io) i^,7rgcxtr'P(e) a;eextr([0,e]) 

{i',x)+{TT,e—x)=0 

6. Contraction rate in Hopf's oscillation seminorm of nonlinear flows 

Let us consider a differentiable application : A" — )■ A". Since is locally Lipschitz, we 
know that for all Xq G A', there is a maximal interval J{xo) such that a unique solution 
x{t; Xq) of 

(28) x{t) = 0(x(t)), x(0) = xo 

is deflned on J{xq). We deflne an application M.(-) : M x A" — > A" by: 

Mt{xo) = x(t]Xo), teJ{xo). 

The application M is the flow of the equation (128|) and it may not be everywhere deflned 
on M X A". Since is continuously differentiable, the flow is differentiable with respect 
to the second variable. We denote by DMt{x) the derivative of the application M with 
respect to the second variable at point (t, x). Recall that 

DMt{x)z = D(p{Mtix)){DMt{x)z), t G J(x), zeX. 

Let U C A" be a convex open set. For Xq G t/ deflne: 

tu{xo) := sup{to ^ J{xo) : x(t; Xq) e U, Vt G [0,to)} 

the time when the solution of (1251) leaves U. 

Suppose that satisfles 0(x + Ae) = 0(x) for all A G M and x E X. By uniqueness of 
the solution, it is clear that for all xq G A" and A G M, 

Mf(xo + Ae) = Mt(xo) + Ae, t G J(xo). 

We deflne the contraction rate of the flow on U with respect to Hopf's oscillation semi- 
norm: 

(29) 

a{U) := inf{/3 G M : \\Mt{x) - Mt{y) \\h ^ e^*||x - yWn, x,yeU,t<: tu{x) A tu{y)}. 
Theorem 6.1. Let satisfy the above conditions. Then we have 

a{U) = sup /i(D0(x)) 

where h is defined in fl2^ . 
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Proof. Denote 

p = suph{D(j){x)). 
xeu 

For any x E U, define 

Lt = D(P{Mt{x)), te[0,tu{x)). 

Let any z E X. Then DMt{x)z : t G [0,tu{x)) is the solution of the following linear 
time-varying differential equation: 

X = Lt{x), t e [0,tu{x)), 
x(0) = z. 

By Proposition 15. 7^ it is immediate that for all z E X, 

u{DMt{x)z/e) ^ e^^uj{z/e), t e [0,tu{x)). 
Let x,y E U and h < tu{x) A tu{y). Denote 7(5) = sx + (1 — s)y : s G [0, 1]. Then, 

u{M^{x) - M^{y)/e) ^ [ u{DMMs)){x - y)/e)ds ^ e^^u{x - y/e). 

Jo 

Therefore, for all x, y G f/, 

hmsup ^ p\\x — y\\H- 

We deduce that for all x,y E U and t < tu{x) A tu{y), 

\\Mt+h{x) - Mt+h{y)\\H ^ nu.. f N Ml 

hmsup ^ p\\Mt{x) - Mt{y)\\H- 

Therefore, 

\\Mt{x) - Mt{y)\\H < e^'\\x - yU, t < tu{x) A tu{y). 

This implies that 

a{U) ^ (3. 

Inversely, for all x E U, there is to > such that for all h ^ to, z E X , 

||M,(x + ^)-M,(x)||H^e°(^)^i|^i|H. 

Therefore, 

lln/i#rv^ll 111- Mhjx + tz) - Mhjx) 
\\DMh{x}{z)\\H = II hm \\h 

t^o+ t 

^ \\M,ix + tz)-M,{x)\\H ^ ^aiU)k.n_ 

t-s>0+ t ^ 

By Theorem \'d.l\ we obtain that for h ^ to, 

sup sup {u - TT, DMh{x)z) ^ e"(^)^ 

It is then immediate that for h ^to, 

sup sup -{u, DMh{x){e - z)) - (tt, DMh{x)z) ^ e'^^^^^ - 1. 

i/jTrSextr P(e) 2Gcxtr([0,e]) 

(u,z) + (-K,e-z)=0 

Dividing the two sides by h and passing to the limit as /i — we get: 

h{D4>{x)) ^ a{U). 

Therefore (i ^ a{U). □ 
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6.1. Applications to non-linear consensus in M". Let G = (y,E) denote a directed 
graph. Let us equip every arc G E with a weight Cij > 0. For ^ E, we set 

C,,^ = 0. 

Example 6.2. (Non linear consensus) Consider the following nonlinear consensus proto- 
col jsMna] : 

(30) = Cik'jikixi -Xk), k = l,...,n, 

where we suppose that every map '■ M*^ — )■ M is differentiable. When every is the 
identity map, the operator at the right hand-side of (130|) is the discrete Laplacian of the 
digraph G, in which Cik is the conductivity of arc (i, k). 

Proposition 6.3. Let w > 0. Suppose that 

(31) a := mfH,{t) : t E [-w, w], {i, k) E E} ^ 0. 
Consider the convex open set 

U{w) = {x : \\x\\h < u)}. 
For x{0) E U{w), the solution of (130|) satisfies: 

||x(t)||H^e^(^)"ix(0)||^, Vt^O. 



Proof. For all x G U , 



where 



(x) ^ d(j)j{x) ^ . d(j)i{x) d(j)j{x) 



T^i dxj dxi f-^ , dxk dx 

x) 



dXj ^ '^ 



CijlUxj-Xi), ]. 



Hence for all x G f/, 



h{D(j){x)) ^ — minCjja + Gjia + min(Cjfca, Cj^a) = ah{G). 

We apply Theorem 16.11 and consider y = I'm the formula fl29l) . Since a ^ and h{C) ^ 0, 
we deduce that the set U{w) is invariant. Therefore we conclude. □ 

The Kuramoto equation |StrOOj is a special case of the protocol fl5U]) . 



(^32) = 5Z C'ySin(6'j -6'i),z = l,...,n. 

Let w < Tx/2. Then 

inf{cos(t) : t E [—w,w]} ^ costf > 0. 

We apply Proposition 16.31 to obtain that for all 6{fS) such that ||6'(0)||iy < w, the solution 
of (l32i) satisfies: 

\\e{t)\\H ^ e''^^) ^^^^'^^ 11^(0) II Vt ^ 0. 
In particular, for all ^(0) G (— 7r/4, 71/4)*^, the solution of equation (1321) satisfies: 

\\0{t)\\H ^ e^(^)™^(ll^(°)ll«)* 11^(0) 11^^, Vt ^ 0. 
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Remark 6.4. Moreau [MorOSj showed that if there is a node connected by path to all other 
nodes in the graph {V, E), then the systems (I5U]) is globally convergent and (152]) is globally 
convergent on the set (— 7r/2, 7r/2)". Compared to his results (see Remark 15.31) . our 
condition for convergence is more strict but we obtain an explicit exponential contraction 
rate. 

Another class of maps satisfying (ISTl) is 'jik{t) = arctan(t). Consider the following 
system 



Xi — ^ ^ Cij arctan(xj 



Xi ) ^ t l,...,Ti. 



(33) 

Then we obtain in the same way that for all x(0) G M", the solution of (133|) satisfies: 

h(C) 



\\x{t)\\H ^e^^^ \\x{0)\\H, Vt^O. 

Example 6.5. (Discrete p-Laplacian) We now analyze the degenerate case of the p-Laplacian 
consensus dynamics for p G (1,2) U (2, +oo). Then latter can be described by the dy- 
namical system in M": 

Vi= ^ Cijivj - Vi) \Cij{vi - Vj) 1^"^ , i = l,...,n. 

Let a > /3 > and consider the convex open sets 

V{I3) := {v : min \vi — Vj\ > /?}, U{a) := {v : max \vi — Vj\ < a}. 

A basic calculus shows that for v G V^(/3), 

^ \ 0, {hlME 
dv.j \{p-l)\v,-v^r^Cf-\ {^3)eE 

Let C^~^ denote the matrix with entries C^'^ . Recall that h{C^^^) ^ 0. We have: 

n[u^[x)) ^ I _ i)h{CP-^)aP-\ l<p<2,xe V{(5) n U{a) 

When 1 < p < 2, the contraction rate on V{(3) (1 U{a) tends to — oo while a tends to 0. 
When p > 2, the contraction rate on V{(3) tends to while (3 tends to 0. If we fix some 
(3 > min(j Cj^^, it can be checked that the contraction rate on V{(3) tends to — oo 
when p tends to +oo. 

7. Contraction rate in Hilbert's metric of non-linear flows 

In this section, we apply Theorem 13.11 to determine the contraction rate in Hilbert's 
metric of the flow of an ordinary differential equations, still in the finite dimensional case. 

7.1. Contraction rate formula in Hilbert's metric. In the following, we consider a 
continuously differentiable application : C° — )■ A" such that 0(Ax) = A0(x), for all A > 
and X G C°. Note that the later property implies that 

D(j){x)x = 0(x). 

We denote by M the flow associated to the differential equation (see Section [6] for nota- 
tions) : 

(34) X = (pix). 
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By uniqueness of the solution, it is clear that for all Xq G C°, 

Mt(Axo) = AMt(xo), t e J(xo). 

Let [/ C C° be a convex open set. Define the optimal contraction rate of the flow in 
Hubert's metric on U by: 

(35) 

a{U) := inf{a G M : dH{Mt{xi), Mt{x2)) ^ e'^'dnixi, Xa), Xi, G [/, t ^ tu{xi) A tu{x2)}. 
For X G C*^, define: 

(36) c(x) := — inf inf {n, D(j){x)z) + {u, D(f){x){x — z)) 

{■IT,z) + {l',X — z)=0 



For the same reason as in the proof of Proposition 15. 1[ it follows that 

(37) c(x) = - inf inf {it, D<j){x)z) + (i/, D0(x)(x - z)) 

2Gcxtr[0,a;] v,iT(^extr'P{x) 
{■n ,z) + {v,x—z)=0 

Theorem 7.1. Let U d denote a convex open set such that \U = U for all \ > 0. 

Then 

(38) a{U) = supc(x). 

xeu 

Proof. First we prove that for all x G ?7, 

cW^i.n.r-(.up-'^-'^'W;f''""-i). 

t^0+ z U{z/x) 

For this, fix x G t/ and define a functional on a neighborhood of J: 

F(l^) = sup sup ( , - , Vr(^)). 

By Theorem 13.11 for t G [0,tt/(x)), 

II i:'Mf(x) II = sup sup {iy-Tr,DMt{x)z) 

2g[0,x] u,n£'P{DMt{x)x) 

V vr 
= sup sup — „ , , . X , — -, — - , , , . , DMt(x)z) 
ze[o.x],y,ner{e) {y,DMt[x)x) {7r,DMt{x)x) 

= F{DMt{x)). 

Therefore, 

i-5.o+ a;(2;/x) 

(39) = limri(||Z}M,(x)||^-l) 

t-5>0+ 

= lim t"\F(DMt(x)) - F(/)) 

i-5.0+ 

Recall that DMt{x) : [0,t,7(x)) End(A') satisfies: 

lim r^(DMt(x) - /) = D(l)ix). 

The following reasoning is similar to that in the proof of Proposition |5]TJ The limit in (139|) 
equals to the semiderivative of F at / in the direction D0(x), if this semiderivative exists. 
Since [0,x] and 'P(e) are compact sets and the function 
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is continuously differentiable on W such that -F,^,7r,2 and the derivative DFy .,^ ^ are jointly 
continuous on (i/, vr, W), we know that F is semidifferentiable. The derivative of F^^^^ z 
at point / in the direction D(j){x) is: 

DF,,^,,(J)(D0(x)) 

{u, D(p{x)z){u,x) — {y, z){y,D(j){x)x) {n, D(p{x)z){7i,x) — {n, z){7t, D(p{x)x) 

(^jy x^^ (tt 
= (i:^^D<p{x)z) - lj^,z){j^,D<Pix)x) - {^yDct>{x)z) + z){^, D<p{x)x) 

Denote 

T{W) = arg maxF,,,,,(M/). 

i^,7r6P(e) 
^G[0,a:] 

Then 

T{I) = {z/, TT G P(e), z e [0, x] : (-^ - 2;) = 1}. 

The semiderivative of F at point / in the direction D(j){x) is then: 
lim t~\F(DMt(x)) - F(I)) 
= sup 

u,n,z£T{I) 

= sup sup {u, D(f){x)z) — {u, D(f){x)x) — {it, D(l){x)z) 

26[0,x] iy,7r6P(x') 
(;/— 7r,2) = l 

= c(x). 

Now fix xq G f/. By Cauchy-Lipschitz, there is r > and to > such that the flow is 
well-defined on [0,to] x B{xo',r) where B{xo;r) is the open ball of radius r centered at 
Xq. We assume that B{xo;r) C U and consider the set G := UA>oA-B(a;o; r). For every 
t ^ to? fhe application Mt is well defined on G such that 

dH{Mt{x),Mt{y)) ^ e°(^)*rf^^(x,i/), Vx,i/ G G. 

By Theorem 13.81 we have 

u{DMt{x)v/Mt{x)) ^ e"(^)*a;(t;/x) \/xeG,veX. 

Therefore, 

dxo) = limsup -(sup -. — ; — 1) ^ aft/). 

t^o+ t z oj[zlxo) 



It follows that 



Finally, denote 



a{U) ^ sup c{x). 



c = sup C[X) 



Then for all x & U, v E X and t G tu{x), 

uj{DMh{Mt{x)){DMt{x)v))/Mh{Mt{x))) - uj{DMt{x)v / Mt{x)) 
= lim sup 

^ u{DMt{x)v/Mt{x))MDMh{Mt{x)){DMt{x)v))/Mh{Mt{x)) 

TJ^^ h ^ uiDM,ix)v/M,{x)) ^ 

< c{Mt{x))uj{DMt{x)v/Mt{x)) ^ cuj{DMt{x)v/Mt{x)). 
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Therefore, for all x & U, v & X and t G tu{x) we have that, 

u}{DMt{x)v/Mt{x)) ^ e^^uj{v/x). 

Let x,y E U and define 7(5) = (1 — s)x + sy, ^ s ^ 1. By the compacity of the set 
{7(5) : s G [0, 1]}, we know that 

to ■.= mi{tu{l{s)) : s G [0,1]} > 0. 

Therefore, using the Finsler structure of Hilbert's metric ( |Nus94| Thm 2.1]), we get that 
for every t ^to, 

dH{Mt{x),Mt{y)) ^ jluj{DMt{i{s)){y-x)/Mt{i{s)))ds 
^ /q e^*u{y — x/j{s))ds 
= e^^duix^y). 

Consequently we proved that for all x, y G f/ 

dH{Mh{x),Mh{y)) - dH{x,y) ^ , , . 
limsup ^ cdu\x,y). 



This implies that for all x,y E U and t < tu{x) A tu{y): 

dH{Mt+h{x),Mt+h{y)) - dH{Mt{x),Mt{y)) 

LP 



lim sup , 

h 



h->0+ h 

^cdH{Mt{x),Mt{y)). 

It follows that 

dH{Mt{x),Mt{y)) ^e''dH{x,y), Wx,y e U,t < tu{x) Atu{y). 

Therefore 

a{U) ^ c. 



□ 



7.2. Contraction rate in Hilbert's projective metric of a non-linear flow on the 
standard positive cone. We specialize the contraction formula (l38l) to the case A" = R 
and C = M"^ under the same notations and assumptions. 

Corollary 7.2. When A" = and C = M^, the contraction rate formula (138|) can be 

specified as below: 

a{U) = supc(x) = sup /i(y4(x)), Vx G f/ 

where 

A{x) = 6{x)-^D(j){x)6{x) 

and h is defined in fl25p . 

Proof. It is sufficient to remark that in this special case: 

extrP(a:) = 5{x)~'^ extrV, 

and 

extr[0,x] = (5(x)extr([0, 1]) 
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Therefore 



c{x) =— inf inf {u, D(j){x)z) + {n, D(j){x){x — z)) 

2ecxtr[0,x] 7r,i/ecxtr'P(x-) 
(u,z) + (tt,x—z)=0 

= - inf inf {6{xy^iy,D(l){x)5{x)z) + {5{x)-^TT,D(f){x)6{x){l- z)) 

2:Gextr[0,l] 7r,i/Gextr'P 

{u,z) + {tt,x—z)=0 

= h{A{x)). 

□ 

Remark 7.3. Consider the hnear flow in MJ^ of the following equation: 

X = Ax, 

where Aij ^ 0, for all i ^ j, so that the flow is order preserving. Let x be in the interior 
of M" . Then we have 

S{xy^A5{x)ij = Aij—, i,j = l,...,n. 

Xi 

Therefore, 

h{5{x)^^ A5{x)) = — minAjj— + A^— + min(Ajfc— , A^^— ). 

The global contraction rate (restricted to C°) is then 

sup h{5{x)~^ A5{x)) = — min 2 Aij A ji. 

Such contraction rate can be alternatively obtained by differentiating with respect to t 
at the contraction ratio of / + tA, using Birkhoff 's theorem. Hence a positive global 
contraction rate exists if and only if Aij > for all i ^ j- However, strict local contraction 
may occur even if there is Aij = for some i j- Let K > 1 and consider the convex 
open set 

U(K) = {xeW:^^ — ^K}. 

K Xj 

Then the local contraction rate with respect to U is 

sup h{6{x)-^A6{x)) ^ 

Therefore, h{A) < is sufficient to have a strict local contraction. Moreover, the above 
bound on the contraction rate decreases (faster convergence) as the orbit approaches to 
consensus, i.e., a multiple of 1. 

7.3. Application to the space of Hermitian matrices. We specialize the contraction 
formula f l38p to the case A" = S„ and C = under the same notations and assumptions. 



Corollary 7.4. When A* = S„ and C = S„, the contraction rate formula fl38|) can he 

specified as below: 

a{U) = sup c(P) = sup /i($(P)) 

Peu p&u 

where $(-P) : — ?■ «s a linear application given by: 

$(P)(Z) = P~^D(t){P){P^ZP^2)p-'2 

and h is defined in ([2H]) . 
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Proof. Remark that in this special case, 

extr[0,P] = P^(extr[0, 4])P5, 

and 

extr(P(P)) = P"5(extrP)P"i 
The desired formula is obtained the same way as in the proof of Corollary 17.21 □ 

Example 7.5. As an example, let us show a calculus of contraction rate using Corollary 17.41 
for the following differential equation in S„: 

-PBP 

where B,C E S^. Let P G S^. Then the apphcation $(P) : S„ — S„ defined in 
Corollary 17.21 is given by: 

^iP){Z) = P-^2D(I){P){P^ZP^)P'^ 

= {-ZP^BP^ -P5PP3Z)trace(CP)-i 
+p\BP^ trace(CP)-2 trace(CPlzP^) + P~^AP^Z + ZA'P~^ 

Therefore let x, y G C" such that x*y = then 

y*^{P){xx*)y = y*P5PP5?/trace(CP5a;x*P5)trace(CP)"^ 

Let {xi, . . . be an orthonormal basis. Denote ai = x*P^PP5xi, a2 = X2P^ BP^X2, 
(3i = xlP^CP^xi and = xlP^CP^xi. Suppose that ai ^ a2. Then 



xl^{P){x2xl)xi + x^<l>(P)(a;ia;^)a;2 + ^mm{xl^{P){xkxl)xi,X2^{P){xkxl)x2) 

k=3 

n 

= (ai/32 + a2/3i + "i xlP^CP^Xu) trace(CP)~^ 



fc=3 

2 



= («i/32 + a2/3i + ai(trace(CP) - /3i - /^s)) trace(CP) 
= («! trace(C*P) + Pi{a2 — ai)) trace(CP)"^ 
^ XrainiBP) trace(CP)-^ 



Therefore by the definition in fl26|) . 

/i($(P)) ^ -A„,i„(PP)trace(CP)-i. 
Let us consider the convex open set 

U = {PeSt:dH{P,In)<K}. 



Then, 



sup /i($(P)) ^ supp^t; -Ai„in(PP) trace(CP)-i 

^ Amin(-BP) ^ Amin(P)Amin(P) 



'^Amax(CP) nAmax(C)Amax(^) 
Amm(P) 



?^Amax(C)e 



K 



Let a = . Then by Corollary 17.4^ for all Pi, P2 G f/ we have: 

riAmax(C)e^ 

dniMtiP,), Mf (P2)) ^ e"*dH(Pi, P2), 0^t< tu{Pi) A ^^/(Pa). 
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If A,B,C are matrices such that 

(j){In) = -5trace(C)-i ^ a + A' = Xoh, 

then we know that 

In that case, for P & U we have: 

dniMtiP), e^"*/„) ^ e'^'dniP, /„), 0^t< tjj{P). 

It follows that tu[P) = +00 and therefore every solution of equation pUj) converges 
exponentially to a scalar multiplication of /„. 
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